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(57) Abstract 

The present invention relates to the modificaUon of immunoglobulin supcrfamily (IgSF) domains. IgSF firagnricnts and fusion jffoteins 
thereof, especially to the modification of antibody derivatives, so as to improve their «^ubility, and hence die yield, and case of h*wil»mg. 
TTic invcntore have found that this can be achieved by making the region which comprises the interface widi domains ^J^"^ if;^^^ I^SF 
domain in a larger fragmem or a full IgSF protein, and which becomes exposed in die IgSF domain, more hydiophihc by modificatton. T%e 
present invention describes DNA sequences encoding modified IgSF domains or fragm«its and fusion 

containing these DNA sequences. IgSF domains or fragments or fusion proteins obtainable by expressmg wid DNA «q««^» «"toble 
expression systems, and a method for modifying IgSF domains, so as to improve their solubility, exprcssibihty and ease of handling. 
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Immunoglobulin Superfamily Domains and Fragments with Increased 
Solubility 



Field of the Invention 

The present invention relates to the modification of immunoglobuling superfamily (IgSF) 
domains and derivatives thereof so as to increase their solubility, and hence the yield, and ease 
of handling. 



Background to the Invention 

Small antibody fragmcnis show exciting promise for use as therapeiilic agents, diagnostic 
reagents, and lor biochemical research. Thus, they are needed in large amounis. and the 
expression of antibody fragments, e.g. Fv. single-chain Fv (scFv). or Fab in the periplasm of 
E. coli (Skerra & PlOckthun 1988; Belter et al.. 1988) is now usetl routinely in many 
laboraiories. Expression yields vary widely, however, especially in the case of scFvs. While 
some fragmcnis yield up to several mg of functional, soluble protein per litre and OD of 
culture broth in shake ilask culture (Carter et al.. 1992. IMucklhun et al. 1 996). other fragments 
may almost exclusively lead to insoluble material, often found in so-called inclusion bodies, 
l unciional protein n>ay be obtained from the latter in modest yields by a laborious and time- 
consuming refolding process. The factors influencing antibody expression levels are still only 
poorly understood. Folding elTiciency and .stability of the antibody fragments, protea.se lability 
and toxicity of the expres.sed proteins to the host cells often severely limit actual production 
levels, and several attempts have been tried to increase expression yields. For example. 
Knappik & Plucklluin (1995) have identified key residues in the antibody framework which 
inlluencc expression yields dramatically. Similarly. Ullrich et al. (1995) found (hat point 
mutations in the CDRs can increase tlie yields in periplasmic antibody fragment expression. 
Nevertheless, these strategies arc only applicable to a few antibodies. 

The observations by Knappik & PlOckthun (1995) indicate that optimizing those parts of the 
antibodv Iraumenl which arc not directly involved in antigen recognition can significantly 
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improve folding properties and production yields of recoinbinant Fv and scFv constructs. The 
causes for the improved expression behavior lie in the decreased aggregation behavior of these 
molecules. Tor other molecules, fragment stability and protease resistance may also be 
alTected. The understanding of how specific sequence modillcaiions change these properties is 
still very limited and currently under active investigation. 

Difllcullies in expressing and manipulating protein domains may arise because amino acids 
which are normally buried within the protein structure become exposed when only a portion of 
the whole molecule is expressed. Aggregation may occur through interaction of newly solvent- 
exposed hydrophobic residues originally forming the contact regions between adjacent 
domains. l,eistler and Perham (1994) could show that a certain domain of glutathione 
reductase may be expressed separately from its neighboring domains, but the protein showed 
non-specific association //; v/V/v; forming niultimeric protein species, fhe introduction of 
hydrophilic residues instead of exposed hydrophobic amino acids could decrease this 
aggregation tendency and thus stabilize this isolated domain. Both wild type and modified 
domains were exclusively Ibund in inclusion bodies and had to be refolded. Although in vitro 
experiments contributed a lot to define various intermolecular interactions, which drive 
folding processes, they are only of limited value in predicting the folding- behaviour of 
dilTerenl poly|>eptide chains in vivo (Gething & Sambrook. 1992). I hus. Lcistlcr and Perham 
do not leach or suggest how to increase expression yields of soluble protein domains. 

In the case of antibodies, two chains comprising several domains dimerize. each domain 
consisting of a p-barrel whose two (J-sheets are held together by a disulphide bond, forming 
the so-called immunoglobulin fold, fwo domains, one variable domain (VL) and one constant 
domain (CL) are adjacent along the longitudinal axis in the light chain (VL-CL). and four 
domains, one variable domain (VH) and three constant domain (CHI to CU3) are adjacent 
along the longitudinal axis in the heavy chain (VH-CH1-CH2-CFI3). In the dimer formed by 

chains a and b. two such domains associate laterally: VL^ with VHa, CLg with CHlg, VL5 

with Vllh, CLb with CHIb Cll2a with CM2b and CH3a with CH3b. In WO 92/01787 

(Johnson et al.. 1992), it is taught that isolated single doitiain.s. e.g. VH. can be modified in the 
former VL/VII interface region liy exchanging hydrophobic residues by hydrophilic ones 
without changing the specificity of the parent domain, fhe rationale for WO 92/01787 was the 
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assumption that exposed hydrophobic residues might lead to non-specific binding, inieraction 
with surlaces and decreased stability. Data for increase in binding specificity was given, but 
increase in expression level uas not shown. Furlliermore. WO 02/01787 would not be 
applicable to any antibody fragment containing the complete antigen binding site, as it must 
contain VL and VII. 

in the case of T cell receptors, two chains (a and P) dimerize. each consisting of a variable 
(V) and a constant (C) domain with the immunoglobulin Ibid, and one transmembrane 
domain. In each chain, the variable and constant domains are adjacent along the longitudinal 
axis in the chains (Va-Ca: vp-CP) and associate laterally with the corresponding domains of 
the second chain ( Va-Vp: Cu-CP). 

Various other molecules ol' the immunoglobulin superfamily. such as CD2. CD4. CDI6. 
CD22. comprise only one chain, wherein two or more domains (variable and/or constant) with 
the immunoglobulin fold arc adjacent along the longitudinal axis in the chains. 

The present inventors have found that expression problems are largely associated with a part 
of the molecule that has hitherto not been regarded relevant for expression studies and which 
comprises the interface between adjacent domains within an immunoglobulin chain. This 
surprising finding forms the basis of the present invention, which provides a general solution 
to the problems associated with production of domains or fragments of the immunoglobulin 
superfamiliy (IgSF). especially antibody fragments, which exhibit poor solubility or reduced 
levels of expression. 



Detailed Description of the invention 

In addition to lateral interactions between domains of different chains described above, there 
are well documented contacts between adjacent domains within Individual chains along the 
longitudinal axis, for example, in the ca.se of an antibody (Uesk & Chothia. 1988). the 
"bottom" of VL makes contact with the "top" of CL. and. in a similar manner there are 
contacts between VII and CHI. The contacts at these inter-domain interfaces arc probably 
essential for the compact arrangement of the Fab fragment, and. as is typical for such contacts, 
are at least partially hydrophobic in nature (Lesk & Chothia. 1988). 
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Ilie basis of the present invention is the surprising Uncling that the solubilit>' (and hence the 
yield) of antibody fragmenls comprising at least one domain can be dramalically increased by 
decreasing the hydrophobicily of former interfaces at the "end" ol said domain, where it would 
normally adjoin a second domain within a chain in a larger antibody fragment or full antibody. 
This is surprising and could not have lx?cn predicted from the prior art (WO 92/01787). 
because the size of the longitudinal interface, for example, in a scl'v fragment, is much smaller 
than that between VH and VL and therefore, the amino acids which make up the interfaces 
between VII and CM! or between VL and CL in a I'ab fragment represent a much smaller 
proportion of the total surface area of the scFv molecule, and would accordingly be expected 
to play less of a role in determining the physical properties of the molecule. 

The present invention has the additional advantage that becau.se the alterations effected in the 
molecules that lead lo said decreased hydrophobicily ol* former interfaces are located at the 
most distant part of the domain from the CDRs. applying the invention is unlikely to have a 
deleterious elTect on the binding properties of the molecule. This is not the case in WO 
92/01787. where at least one modification is close to the CDRs and may therefore be expected 
to have an impact on antigen binding. Furthermore. WO 92/01787 cannot be applied to 
ViyVH hcterodimers. as explained above. 

The present invention relates to a modified immunoglobulin supcrfamily (IgSF) domain or 
fragment which differs from a parent IgSF domain or fragment in that the region which 
comprised or would comprise the interface with a second domain adjoined to said parent IgSF 
domain or fragment within the protein chain of a larger IgSF fragment or a full IgSF protein, 
and which is exposed in said parent IgSl' domain or fragment in the absence of said .second 
domain, is made more hydrophilic by modification. 

In the context of the present invention, the term immunoglobulin superi'amily (IgSF) domain 
refers lo those parts of members of the immunoglobulin superfamily which are characterized 
by the immunoglobulin fold, said supcrfamiliy comprising the immunoglobulins or antibodies, 
and various other proteins such as f-cell receptors or integrins. The term IgSF fragment refers 
lo any portion of a member of the immunoglobulin supcrfamiliy. said portion comprising at 
least one IgSF domain. I hc term adjoining domain refers lo a domain which is contiguous 
with a llrst domain, fhe term interface refers to a region of said first domain where interaction 
with the adjoining domain lakes place. The terms hydrophobic and hydrophilic refer to a 
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physical property of amino acids, which can be estimated quantitatively: tabulated values of* 
hydrophobicity for the twenty naturally-occurring amino acids arc available {Nozaki & 
Tanibrd. 1971; Casari & Sippl. 1992: Rose & Wolfendcn. 1993). 

The residues to be modified can be identiHed in a number of ways. l*or example, in one way, 
the solvent accessibilities (Lee & Richards. 1971) of hydrophobic interface residues in said 
parent IgSF fragment compared to said larger IgSF fragment or full IgSF protein are 
calculated, with high accessibilities indicating highly exposed residues. In a second way, the 
number of van dcr Waals contacts of hydrophobic interface residues in said larger IgSF 
fragment or full IgSF protein is calculated. A large number for a residue ol'said parent domain 
indicates that it will be highly solvent-exposed in the absence of an adjoining domain. There 
arc other ways of calculating or determining residues to be modified according to the present 
invention, and one of ordinary skill in the art will be able to identify and practice these ways. 

By analyzing computer models of said parent IgSF fragment, interactions of said highly 
exposed residues within the fragment can be identified. Such interactions could stabilize the 
parent IgSF fragment. Residues, which interact closely with other hydrophobic residues and 
which can be identified by anyone of ordinary skill in the art. should not preferentially be 
mutated. 

fhe modification referred to above may be effected in a number of ways which are well 
known to one skilled in the art. In a preferred embodiment, the modification is a substitution 
of one or more amino acids at the exposed inlerface. identified as described above, with amino 
acids which are more hydrophilic. Alternatively, one or more amino acids can be inserted in 
said interface, or one or more amino acids can be deleted from said interface, so as to increase 
its overall hydrophilicity. l-urthermore, any combination of substitution, insertion and deletion 
can be effected to reduce the hydrophobicity of said interface. Also comprised by the present 
invention is the possibility that the substitution or insertion comprises amino acids with a 
relatively high hydrophobicity value, or that the deletion comprises amino acids with relatively 
low hydrophobicity value, as long as the overall hydrophilicity value is increased in said 
interface region. Modifications such as substitution, insertion and deletion can be effected 
using standard methods which are well known to practitioners skilled in the art. By way of 
example, the skilled artisan can use either site-directed or PCR-based mutagenesis (Ho ct al.. 
1989: Kunkel et al.. 1991 : 1 rower, 1994: Viville. 1994), or total gene synthesis (Prodromou & 
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f*earl. 1992) to elTecl ihe necessary inoiliricalion(s). In a Tiirther cmlxxlimeni. the mutations 
may be obtained by random mutagenesis and screening oi' random mutants, using a suitable 
expression and screening system (see. lor example. Stemmer. 1994: Cramcri ct al.. 1996). 

in a preferred embodiment, the amino acid(s) which replacc(s) the more hydrophobic amino 
acids include Asn, Asp. Arg. Gin. CJIu. (ily. I lis. Lys. Scr. and Thr. I hese are among the more 
hydrophilic of the 20 nalurally-occurring amino acids, and have proven to be particularly 
elTeclive in the application of the present invention. Said amino acids, alone or in 
combination, or in combination with other amino acids, can also be used to form the above 
mentioned insertion which makes the interface region more hydrophilic. 

The parent IgSF domain or fragment referred to above can be one of several different types. In 
a preferred embodiment, said parent domain or fragment is derived from an antibody. In one 
embodiment, said parent antibody fragment comprises an Fv fragment. In this context, the 
term I'v fragment refers to a complex comprising the VL (variable light) and VH (variable 
heavy) portions of the antibody molecule. In a further embodiment, the parent antibody 
fragment may be a .single-chain Fv fragment (scFv: IJird et al.. 1988: l lasion et al.. 1988). in 
which the VL and VH chains are joined, in either a VL-VH. or VH-VL orientation, by a 
peptide linker. In yet a further embodiment, the parent antibody fragment may be an Fv 
fragment stabilized by an inter-domain disulphidc bond. This is a structure which can be made 
by engineering into each chain a single cysteine residue, wherein said cysteine residues from 
two chains become linked througli oxidation to form a disulphide (Glockshubcr ci al.. 1990: 
Brinkmannetal.. 1993). 

In a most preferred embodiment, the interface region of the variable domains mentioned above 
comprises residues 9. 10. 12, 15.39,40.41. 80, 81, 83, 103. 105, 106. 106A. 107, 108 for VL. 
and residues 9. 10. 11. 13. 14. 41, 42. 43, 84, 87, 89, 105. 108, 110. 112. 113 for VH 
according to the Kabat numbering system (Kabat ct al.. 1991). Said numbering system was 
established for the sequences of whole antibodies, but can be adapted correspondingly to 
describe the sequences of isolated antibody domains or antibody fragments, even in the case of 
scFv fragments, where VL and VH arc connected via a peptide linker, and where the protein 
sequence from N- to C-tcrminus has to be numbered differently. This means that the Kabat 
numbering system is used in the present invention as a sequence description relative to the 
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existing daia on anlibocly sequences, nol as an absolute description of actual positions within 
the antibody fragment sequences of interest. 

In a liirlher embodiment, said parent antibody fragment comprises a I'ab fragment. In this 
context, the term Tab refers to a complex comprising the VL-CL (variable and constant light) 
and VII-CIH (variable and first constant heavy) portions of the antibody molecule, and the 
term interface region refers to a region in the first constant domain of the heavy chain (CHI) 
which is. or would be adjoined to* the CH2 domain in a larger antibody fragment or full 
antibody. 

In a still further embodiment, said parent IgSF fragment is a fusion protein of any of said 
domains or fragments and another protein domain, derived from an antibody or any other 
protein or peptide. The advent of bacterial expression oi* antibody fragments has opened the 
way to the construction of proteins comprising i'usions between antibody fragments and other 
molecules. A further embodiment of the present invention relates to such fusion proteins by 
providing for a DNA sequence which encodes both the modified IgSF domain or fragment, as 
described above, as well as an additional moiety. Particularly preferred arc moieties which 
have a useful therapeutic function. For example, the additional moiety may be a toxin 
molecule which is able to kill cells (Vitetta el al., 1993). There are numerous examples of such 
toxins, well known those skilled in the art, such as the bacterial toxins Pseudomonas exotoxin 
A, and diphtheria toxin, as well as the plant toxins ricin. abrin. modcccin. saporin. and 
gelonin. By fusing such a loxin to an antibody fragment, the toxin can be targeted to. for 
example, diseased cells, and thereby have a beneficial therapeutic effect. Aliernaiively. the 
additional moiety may be a cytokine, such as IL-2 (Rosenberg & Lotze. 1986). which has a 
particular effect (in this case a T-cell proliferative effect) on a family of cells. In a further 
preferred embodiment, the additional moiety is at least part of a surface protein which may 
direct the fusion protein to the surface of an organi.sm. for example, a cell or a phage, and 
thereby displays the IgSF partner. Preferably, the additional moiety is at least part of a coat 
protein of filamentous bacteriophages, most preferably of the genelll protein. In a further 
embodiment, tiie additional moiety may confer on its IgSF partner a means of detection and/or 
purification. For example, the fusion protein could comprise the modified IgSF domain or 
fragment and an enzyme commonly used for detection purposes, such as alkaline phosphata.se 
(Blake el al.. 1984). There arc numerous other moieties which can be used as detection or 
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purification tags, which arc well known to the praclilioncr skilled in the art. Particularly 
prelbrred nre peptides comprising at least five histidine residues (Uochuli cl nl.. 1988), which 
are able to bind to metal ions, and can ihererorc be used for the purification of the protein to 
which they arc fused (Lindner et al.. 1992). Also provided for by the invention are additional 
moieties such as the commonly used c-myc and FLACi tags (Uopp el al.. 1988: Knappik & 
Pliickthun. 1994). 

By engineering one or more fused additional domains. IgSF domains or fragments can be 
assembled into larger molecules which also fall under the scope of the present invention. To 
the extent that the physical properties of the IgSF domain or fragment determine the 
characteristics of the assembly, the present invention provides a means of increasing the 
solubility of such larger molecules. I'or example, mini-antibodies (Pack. 1994) are dimers 
comprising two antibody fragments, each fused to a self-associating dimcrizalion domain. 
Dimerizalion domains which are particularly preferred include those derived IVom a leucine 
zipper (Pack & PlOckthun. 1992)or helix-turn-hclix motif (Pack ctal.. 1993). 

All of the above embodiments of Ihe present invention can be effected using standard 
techniques of molecular biology known to anyone skilled in the art. 

The compositions described above may have utility in any one of a number of settings. 
Particularly preferred are diagnostic and therapeutic compositions. 

fhc present invention also provides methods for making the compositions and compounds 
comprised therein described above. Particularly preferred is a method comprising the 
following steps: 

i) analyzing the interface region of an IgSF domain for hydrophobic residues which are 
solvent-exposed using either a solvent-accessibility approach (Lee & Richards. 1971), 
analysis of van der Waals interactions in the interface region, or similar methods which are 
well known to one skilled in the art. 

ii) identifying one or more of the hydrophobic residues to be substituted by more hydrophilic 
residues, or one or more positions where hydrophilic residues or amino acid stretches 
enhancing the overall hydrophilicily of the interface region can be inserted into said 
interlhce region, or one or more positions where hydrophobic residues or amino acid 
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strclchcs enhancing ihe overall hvdrophobicily of the inlei facc region can be deleted from 
said interface region, or any combination of said substitutions, said iascrtions. and said 
deletions to give one or more mutants of said parent IgSI- domain. 

iii) preparing DNA encoding mutants of said IgSI- domain, characterized by tl,e changes 
identified in ii). by using e.g. conventional mutagenesis or gene synthesis methods, said 
DNA being prepared cither separately or as a mixture. 

iv) introducing said DNA or DNA mixture in a vector system suitable for expression of said 
mutants. 

V) introducing said vector system into suitable host cells and expressing said mutant or 
mixture of mutants. 

vi) identifying and characterizing mutants which are obtained in higher yield in soluble form. 

and 

vii) if necessary, repeating steps iii) to vi) to increase the hydrophilicitjr of said identified 
mutant or mutants further. 

1 he host referred to above may be any of a number commonly used in the production of 
heterologous proteins, including but not limited to bacteria, such as E. coli (Ge et al. 1995). or 
lkwilh,.s suhlUis (Wu et al.. 199.1). fungi, such as yeasts (Morwitz et al.. 1988: Ridder et al.. 
1995) or filamentous fungus (Nyyssfinen et al.. 1993). plant cells (Hiatl. 1990. Iliatt & Ma. 
1993: Whilclam el al.. 1994). insect cells (Potter et al.. 199.1: Ward et al.. 1995), or 
mammalian cells (Trill et al.. 1995). 

1 he invention also relates to a method for Ihe production of an IgSF domain or fragment of the 
invention comprising culturing a host cell of the invention and i.solating said domain or 
fragment. 

The invention is now demonstrated by Ihe following examples, which are presented for 
illustration only.and are not intended to limit the scope of Che invention. 
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Examples 

i) Abbreviations 

Abbreviations: CDR: complementarity determining region: dsFv: disiilllde-linkcd Fv 
fragment: IMAC: immobili/xd metal ion aHinity chromatography: IPTG: isopropyl-P-D- 
Ihiogalaclopyranoside: i/s: ratio insoluble/soluble: H(X): heavy chain residue number X; L(X): 

light chain residue number X: NTA: nitrilo-triacetic acid: ()l^5so- op^i^^i density at 550 nm: 

PDB: protein database: scFv: single-chain Fv iragment: SDS-PAGli: sodium dodecyl sulfate 
polyacryiamide gel electrophoresis: v/c: variable/constant: wt: wild type 



ii) Material and Methods 

(a) Calculation ofsolvent accessibility 

Solvent accessible surface areas lor 30 non-redundant Fab fragments and the Fv fragments 
derived from these by deleting the constant domain coordinates from (he PDI) tile were 
calculated using the latest version, as of March 1996. of the program N ACCESS 
(hltp://www.biochem.ucl.ac.ukrroman/naccess/naccess) based on the algorhithm described by 
Lee & Richards (1 97 1). 

(b) scFv gene synthesis 

The single-chain Fv fragmenl (scFv) in the orienlalion Vj^-linker-VH of the antibody 4-4-20 
(Bedzyk et al., 1990) was obtained by gene synthesis (Prodromou and Pearl. 1992). The Vl 
domain carries a Ihree-amino acid long FLAG lag (Knappik and PiQckthun. 1994). We have 
used Iwo diflercnl linkers wiili a length of 15 (Gly4Ser)3 and 30 amino acids (Gly4Ser)6, 
respectively. The gene so obtained was cloned into a derivative of the vector plG6 (Ge el al., 
1995). fhe mutant antibody fragments were constructed by site-directed mutagenesis (Kunkel 
ct al.. 1987) using single-stranded DNA and up to three oligonucleotides per reaction. 
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(c) Expression 

Grovvili curves were obtained as follows: 20 ml of 2xYT medium containing I0() 
ampiciliin and 25 ^ig/ml streptomycin were inoculated with 250 mI ot an overniijhi culture of 
colt JM83 harboring the plasmid encoding the respective antibody fragment and incubated 
at 24.5°C until an 00550 of 0.5 was reached. IPTG (Biomol Feinchemikalien GmbH) was 
added to a final concentration of 1 mM and incubation was continued for 3 hours. The OD 
was measured every hour, as was the (i-lactamase activity in Ihe culture supemantanl lo 
quantify the degree of cell leakiness. Three hours after induction an aliquot of the culture was 
removed and the celts were lyscd exactly as described by Knappik and Pliickthun (1995). The 
P-laciamasc activity was measured in the supernatant, in the insoluble and in the soluble 
fraction. The iVaclions were assayed for antibody fragments by reducing SDS-PAGE. with the 
samples nomiali^cd to OD and (J-lactamase activity to account for possible plastnid loss as 
well as for cell leakiness. The gels were blotted and inmiunosiained using the FLACi antibody 
Ml <Prickea et al.. 1989) as the first antibody, an Fc-specific anti-mouse antiserum 
conjugated to horseradish peroxidase (Pierce) as the second antibody, using a 
chemoluminesccnt detection assay described elsewhere (Ge et al.. 1 995). 

(d) Purillcalion 

Mutant scFv fragments were purified by a two-column procedure. After I'rench press lysis of 
the cells, the raw E. coli extract was first purified by IMAC (Ni-NTA superfiow. Qiagen) 
(20 mM HEPRS. 500 mM NaCl. pi I 6.9: step gradient of imidazole 10, 50 and 200 mM) 
(Lindner et al., 1992) and. alkr dialyzing the IMAC elualc against 20 mM MES pH 6.0. 
finally ptirified by cation exchange chromatography (S-Sepharose fast How column. 
Pharmacia) (20 mM MES. pil 6.0: salt gradient 0-500 mM NaCI). Purity was controlled by 
Coomassie stained SDS-PAGE. I he functionality of the sci v was tested by competition 
EEISA. 

Because of its very poor solubility in the periplasmic system, the wi 4-4-20 was expressed as 
cytoplasmic inclusion bodies in the T7-based system (Sludier & Moffatt. 1986: Ge et al.. 
1995). The refolding procedure was carried out as described elsewhere (Ge el aL. 1995). For 
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purification. the reiblding solution (2 1) was loaded over 10 I) wiihout prior dialysis onto a 
Ihiorcscein aHlnity column, rollowed by a washing slcp wilh 20 niM I lliPIiS. 150 niM NaCI. 
pH 7.5. rwo column volumes of I mM lluorescein (sodium sail. Sigma Chemicals Co.) pH 
7.5 were used lo elule all lUnciional scl-v fragment. Hxlensivc dialysis (7 days with 12 buffer 
changes) was necessary lo remove all fluorescein. All purified scFv fragments were tested in 
gel filiralion (Supcrose-12 column. Pharmacia SMART-Syslem. 20 niM HEPES. 150 mM 
NaCI. pH 7.5). 

(e) Ki) detemiination by fluorescence titration 

The concentrations of the proteins were determined photometrically using an extinction 
coefficient calculated according lo Gill and von llippel (1989). Fluorescence titration 
experiments were carried out by taking advantage of Ihe intensive Huorescence of fluorescein. 
Two nil of 20 mM I IUPHS. 150 mM NaCI. pH 7.5 containing 10 or 20 nM fluorescein were 
placed in a cuvette with integrated stirrer. The excitation wavelength was 485 nm. emission 
spectra were recorded from 490 to 530 nm. Purified scFv (in 20 mM HEPtS. 150 mM NaCI, 
pll 7.5) was added in 5 to 100 ^1 aliquots. and after a 3 min equilibration time a spectrum was 
recorded. All spectra were recorded at 20X. The maximum of emission at 510 nM was used 
for determining the degree of compicxalion of scFv lo fluorescein, seen as quenching as a 
function of the concentration of the antibody fragment. The Kd value was determined by 

Scatchard analysis. 

(I) Equilibrium denaturation measurement 

Equilibrium denaturation curves were obtained by denaturation of 0.2 |.iM protein in IIEPES 
buffered saline (HBS) buffer (20 mM IIEPES, 150 mM NaCI. 1 mM EDTA. pH 7.5) and 
increasing amounts of urea (1.0-7.5 M: 20 mM HEPES. 150 niM NaCI. pH 7.4: 0.25 M steps) 
in a total volume of 1.7 ml. After incubating the samples at lOX for 12 hours and an 
additional 3 hours at 20°C prior to measurements, the fluorescence spectra were recorded at 
2()°C from 320-360 nm with an excitation wavelength of 280 nm. The emission wavelength of 
the fluorescence peak shifted from 341 lo 347 nm during denaturation and was used for 
determining the fraction of unfolded molecules. Curves were filled according lo Pace (1990). 
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(g) Thermal denaluralion 

For measuring the thermal denaluralion rales, purified scFv was dissolved in 2 ml HBS buffer 
to a final concentration of 0.5 mM. The aggregation was followed for 2.5 h at 40°C and at 
44''C by light scattering at 400 nm. 

iii) Results 

(a) Comparison of known antibody sequences 

Compared lo other domain/domain inlerfaces in proteins, the interface between 
immunoglobulin variable and constant domains is not very tightly packed. A comparison of 
30 non-redundant Fab structures in the PDB database showed that between the light chain 
variable and constant domain an area of 410 ±90 A2 per domain is buried, while the heavy 
chain variable and constant domains interact over an area of 710 ± 180 A^. Some, but not all 
of the interface residues are hydrophobic, predominantly aliphatic. Generally, sequence 
conservation of the residues contributing to the v/c domain interface is not parliculariy high. 
Still, the v/c domain interface shows up as a marked hydrophobic patch on the surface of an 
Fv fragment {Fig. I). 

Solvent accessible surface areas for 30 non-redundant Fab fragments and their corresponding 
Fv fragments (derived from the Fab fragment by deleting the constant domain cwrdinates 
from the POB file) were calculated using the program NACCESS (Lee & Richards. 1971). 
Residues participating in the v/c domain interface were identified by comparing the solvent- 
accessible surface area of each amino acid side chain in the context of an Fv fragment lo its 
accessible surface in the context of an Fab fragment. Figure 2 shows a plot of the relative 
change in side chain accessibility upon deletion of the constant domains as a function of 
sequence position. Residues which show a significant reduction of side chain accessibility are 
also highlighted in the sequence alignment, lb assess sequence variability in the positions 
identified in Figure 2. the variable domain sequences collccied in the Kabal database (status 
March 1996) were analyzed (Table I). Of the 15 interface residues identified in the Vl 
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domain of (he antibody 4-4-20 (Tig. I and Tabic I ). l,9(leuK l.l2(pro). M5(lcu). L40(pro). 
L83(leii). and L106(ilc) are hydrophobic and Iherefbrc candidates for replacement: Of the 16 
inlerracc residues in the Vh domain. Ill l(leu). MI4(proK ll4l(pro). MK4(val). I487(met) and 
II89(ilc) were idcnlilicd as possible candidates for siibstitulion by hydrophilic residues in the 
scKv iragmcnl of the antibody 4-4-20 (Fig. 1 and Table I ). 

Not all of these hydrophobic residues arc equally good candidates for replacements, however. 
While residues which are hydrophobic in one particular sequence but hydrophilic in many 
other sequences may appear most attractive, the conserved hydrophobic residues listed in 
Table 1 have also been investigated, since the evolutionary pressure which kept these 
conserved residues acted on the Fab fragment within the whole antibody, but not the isolated 
Fv portion. In this study, we did not replace the proline residues since pro L40 and pro H4I 
form the hairpin turns at the bi>ttom of the framework 11 region, while the conser\'ed Vl cis- 
proline L8 and proline residues H9 and HI4 determine the shape of framework I of the 
immunoglobulin variable domains. 

Excluding prolines, this leaves residues L9 (leu in 4-4-20. ser in most kappa chains). LIS (leu. 
usually hydrophobic). L83 (leu. usually val or phc) and LI 06 (He. as in 86% of all kappa 
chains) in the Vl domain and 111 I (leu as in 60% of all heavy chains), 1184 (val. in other Vh 
domains frequently ala or ser). 1187 (met. u.sually ser) and H89 (ile. most frequently val) in 
Vh as possible candidates for replacement in the 4-4-20 scFv fragment. 

(b) Mutations in the 4-4-20 scFv 

For the 4-4-20 scFv fragment some of the crucial residues identified in the sequence analysis 
described above are already hydrophilic. but nevertheless 9 residues are of hydrophobic nature 
(including pro 12 in the light chain) (Table I). We chose three residues for closer analysis by 
mutations. 

Leul5 in Vl is a hydrophobic amino acid in 98 % of all kappa chains (Table I). Leull is 
conserved in Vh (Table I) and is involved in v/c interdomain contacts (Lesk & Chothia. 
1988). In contrast, valine cKcurs very infrequently at position H84: mainly found at this 
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position arc lh««)nine or serine and alanine (Table 1 ). As can be seen in Figure I. val84 is 
contributing to a lari:c liydrophobic patch at the newly exposed surface of V||. All three 
positions wre mutated into acidic residues, and LI1 was also changed to asparaginc (Tabic 
2). 

The scFv fragment was tested and expressed with two different linkers, the 15-mer linker 
(Gly4Ser)3 (Huston et al.. 1995) and the same motif extended to 30 amino acids (Gly4Ser)6. 
All mutations were tested in bo\h constructs. The in vivo results of the different mutations on 
solubility were identical, and therefore only the results of the 30-mer linker are described in 
more detail. The pcriplasmic expression experiments were carried out at 24.5°C. and all 
coiLstnicts were tested for soluble and insoluble protein by immunoblotting. The ratio of 
insoluble to soluble (i/s) protein was delennined for every mutant. In Figure 3 A-D. insoluble 
(lane I ) and soluble (lane 2) fractions of the wt scFv are shown. Nearly no soluble material 
occurs in periplasmic expression, which is consistent with previous reports of Bedzyk et al. 
(1990) and Denzin el al. (1991). who described earlier that the periplasmic expression of the 
wt scFv leads mainly to periplasmic inclusion bodies. 

The single point mutation LI5E in Vl (Flul) shows no effect on the ratio i/s when compared 
with the wt (Fig. 3A. lane 3. 4). Mutating leu at position 1 1 in the heavy chain to asparaginc 
(Flu2) aLso .shows nearly no effect compared to the wt. whereas the subtiliition with aspartic 
acid (Flu3) changes the i/s ratio to more soluble protein, but still this effect is not ver>- 
dramatic. In contrast, the point mutation at position 84 (Flu4. Fig. 3B. lane 3. 4 and Fig 3D. 
lane 3. 4) had a dramatic influence on the solubility of the scFv fragment of the antibody 4-4- 
20. The ratio i/s is changed to about 1:1. resulting in a 25-fold increase of soluble protein 
compared to the wt. 

The combination of V84D with LlIN or LIID (Flu5. Flu6) also changes the ratio i/s 
compared to the wt. but this ratio compared to V84D alone is not improved further (Fig. 3B). 
Interestingly, the combination of Fiu5 with the light chain mutation at position 1 5 (nu9) leads 
to less soluble material (Fig. 3C lane 7.8) than Flu5 itself (Fig. 3B. lane 5. 6). The negative 
inlluence of the LI5E mutations can also be seen in Flu8 (Fig. 3C. lane 5. 6) compared with 
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Flii3 (Fig. 3A. lane 7. 8). In Fig. 3D the comparison of Ihc vvt (lane I. 2 and 5. 6) and Flu4 
(lane 3. 4 and 7. 8) is shown in both the 15-mer and the 30-mer construct. 
The negative elTect of LI 51: can be rationalized by looking at a model of the 4-4-20 scFv 
fragment. IJ5 is forming a hydrophobic pocket together with residues A80. L83. and LI 06. 
Apparently. LIS stabilizes the scFv fragment by hydrophobic interactions with its closest 
neighbours. Thus the exchange LI5H for making the scFv IVagnienl more hydrophilic and 
more soluble is made at the expense of the fragment stability. The analysis of hydrophobic 
interactions within a fragment should thereby by used to choose the solvent-exposed residues 
to be mutated in Ihe case of any other antibody fragment. 

Combinations of various serine mutations in VH led to further improvements in the i/s ratio. 
I hc mutants FH15 (V84S, M87S. 189S) and FF{20 (LI IS. V84S. M87S. I89S) both showed 
more than 70% of .soluble protein in immunoblols (data not shown). 
The negative elVcct of LI 51Z 



(c) i-unctional expression and purification 

The oligomeriziUion of scFv fragments as a function of linker length has been investigated 
previously. A continuous decrease in the amount of dimcr and nniltimer formation as a 
function of linker length has been reported (Desplancq et al.. 1994: Whitlow et al., 1994), 
While the (Gly4Ser)3 linker has been shown to lead to monomeric scFvs in many cases in the 
Vh-V| direction, this is oHen not the case in the Vl-Vh direction. This is caused by an 
asymmetry in the Vl/V| | arrangement, leading to a longer distance between the end of Vn 
and the N-tcrminus of Vl than between C-terminus of Vl and N-terminus of Vh (Huston et 
al.. 1995). Consequently, a linker of identical length may lead to ditTereni properties of the 
resulting molecules. 

Since we have cho.sen to use the minimal pcrtubation FLAG (Knappik & Pliicklhun, 1994) at 
the N-icnninus of Vj^ in our constructs and thus the VL-linker-Vn orientation, we have 
investigated the use of longer linkers. In the pcriplasmic expression in E coli no difference 
between ihe l5-mer and the 3()-mer linker in the corresponding mutants is visible (Fig. 3D), 
but when we attempted to purify the two Flu4 scFvs with long and short linker, a big 
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discrepancy belween the iwo constructs was found. The purification of the Flu4 mutant 
(V84D) with the 15-mcr linker leads 1o very small amounts of partially purified protein (about 
0.015 mg per liter and OD: cslinialed from SDS-PAGE alter IMAC purification), whereas the 
30-mer linker construct gives about 0.3 mg per liter and OD of highly pure functional protein. 
All mutants with 30-mer linker were tested in gel filtration and found to be monomeric (data 
not shown). 

For further in vUro characterization five mutants were purified with the 30-mcr linker. V84D 
(Flu4). V84D/LI ID {Flu6). LI ID (Flu3), and the serine mutants FH15 and FH20 (see iii(b)). 
A two-step chromatography, first using IMAC and tlien cation-exchange chromatography, led 
to homogeneous protein. The i/s ratio of the antibody fragments (Fig, 3) was also refiected in 
the purification yield of functional protein. The highly soluble mutant Flu4 (V84D) (Fig. 3B 
lane 3. 4) yielded about 0.3 mg purified and functional protein per liter and OD, Flu6 
(L11D/VR4D) (Fig. 3B lane 7. 8) yielded about 0.25 mg per liter and OD and Flu3 (less 
soluble material on the blot in Fig. 3 A lane 7. 8) yielded 0.05 mg per liter and OD. The serine 
mutants Fill 5 and FI120 yielded 0.3 mg and 0.4 mg per liter and OD. respectively. The wt 
scFv of the antibody 4-4-20 did not give any soluble protein at all in periplasniic expression 
with either linker, and it was therefore expressed as cytoplasmic inclusion bodies, followed by 
refolding in vino and lluorcscein affinity chromatography. Phe refolded wl scFv was shown 
by gel filtration lo be monomeric with the 30-mer linker (data not shown). 

(d) Biophysical properties of the mutant scFvs 

Since we changed amino acids which are conserved, it cannot be excluded that changes at 
these positions may be transmitted through the structure and have an effect on the binding 
constant, even though they are very far from the binding site (Chalellier et aL. 1996). To 
eliminate this possibility, wc determined Ihe binding constant of the mutants Flu3. Flu4, Flu6 
and the wt scFv. Muorescence titration was used to determine Kd in solution by using the 
quenching of the intrinsic fiuorescence of fluorescein when it binds lo the antibody. The 
fiiiorescencc quenching at 510 nni was measured as a function ol addcd .scFv. The Kq values 
( fable 3 and l-ig, 4) obtained for all three mutant scFvs and the wt scFv are very similar and 
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correspond very well lo the recently correclcd Kd of Ihe monoclonal antibody 4-4-20 
(MiklaszelaL. 1995). 

To determine whether the mutations had an influence on the thermodynamic stability of the 
protein wc determined the equilibrium unfolding curves by urea denaluration. V84D mutant 
and ihc wt scFv were used for this analysis, and in Figure 5 an overlay plot is shown. The 
midpoint of both curves is at 4.1 M urea. Both curves were lilted by an algorhilhm for a two- 
stale model described by Pace (1990), but the apparent small difference between the V84D 
mutant and the wt scFv is not of statistical significance. 

Aggregation of folding intermediates could be another explanation for the different in vivo 
results between the mutant scFvs and the \vl scFv (l"ig.3). In the periplasm of /T. co//, the 
protein concentrations are assumed lo be rather high (van Wielink & Duine. 1990) and the 
aggregation elTects could thus be pronounced. In order to estimate the aggregation behavior in 
vitro, we have measured the thermal agga^gation rales at different temperatures. In Figure 6 it 
is clearly seen that the wt scFv is significantly aggregating already at 44*^0. whereas the 
mutant V84D tends lo aggregate more slowly. The wl scFv is thus clearly more aggregation 
prone than the mutant scFv. This is veiy similar lo the obser\'ations made with different 
mutations on the antibody McPC603 (Knappik and Pluckthun, 1995), where no correlation 
was found between equilibrium denaluration curves and expression behavior, but a good 
correlation was found with the thermal aggregation rates. 

Figures antJ Tables 

I' igure I : Space-filling representation of the Fv fragment of the antibody 
4-4-20 

Figure 2: Variable/constant domain interface residues lor Vl (2a) and Vn (2b). For 30 non- 
redundant Fab fragments taken from the Brookhaven Databank, the solvent 
accessible surface ol'the amino acid side chains was calculated in the context of 
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an Fv and of an l-ab fragmcnl. Tlie plot shows the relative reduciion in accessible 
surface upon contact with the constant domains (overlay plot for all 30 Fv 
fragments). In the sequence alignment, residues contributing to ihe v/c interface 
arc highlighted. The symbols indicate the relative reduction of solvent accessible 
surface upon removing the constant domains (symbols: no symbol < I %: • < 
20%: ■ < 40%: ▲ < 60%: T < 80%, and ♦ > 80%). Circles indicate those 
positions which are further analyzed (see Table I ). 

Figure 3: Western blots showing the insoluble (i) and soluble (s) fractions of cell extracts. 

prepared as described in Material and Methods, expressing the scFv fragments of 
the antibody 4-4-20. The amino acids substituted in the various mutants are given 
in 1 able 2. 

Figure 4: Scatchard plot of the fluorescence titration of lluoresccin (20 nM) with antibody 
(4 to 800 nM). measured at 510 nm. The value r was obtained from (K-FoV(Foo- 
Fo), where F is the measured fluorescein fluorescence at a given antibody 
concentration. Fq is the fluorescence in the absence of antilxKly and Foo when 
antibody is present in large excess. Note that r gives the saturation of fluorescein 
by antibody, (a) Titration of wt scFv. (b) titration of Flu4 (V84D). 

Figure 5: An overlay plot of the urea denaturation curves is shown. (X) wt scFv. (o) I'lu4. 

Figure 6: Thermal denaturation lime courses at 40 and 44^C for wt and Flu4 scFv fragment 
are shown, (a) wt scFv at 40^C. (b) Flu4 at 40°C, (c) Flu4 at 44^C. (d) wt scFv at 
44*^0. 

Table I : Sequence variability of residues contributing to the v/c interface: Residue statistics 
are based on the variable domain sequences in the Kabat database (March 1996). 
Sequences which were <90% complete were excluded from the analysis. Number 
of sequences analyzed: human VI. kappa: 404 of 881. murine VL kappa: 1061 of 
2239. human VL lambda: 223 of 409. murine VL lambda: 71 of 206. human VH: 
663 of 1756. murine VH: 1294 of 3849. Position refers to the sequence position 
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aecording lo Kabal cl aL 1991. %e.\p. (Fab) lo Ihc relative side chain accessibility 
in an Tab fragment as calculated by the program NACCT-SS (N ACCESS v2.0 by 
Simon l-Iubbard (http:/Avww.biochcm.ucl.ac.uk/'-ronian/nacccss/nacccss.litmI)). 
%exp. (ind.) to llic relative side chain accessibility in the isolated VL or VH 
domain. %biiried to the relative difference in side chain accessibility tetween Fv 
and Fab fragment. Consensus refers lo the sequence consensus, and Distribution 
to the distribution of residue types. 

fable 2: Mutations introduced in the scFv fragment of the antibody 4-4-20: Each line 

represents a different protein carrying the mutations indicated. The residues are 
numbered according to Kabal el al. (1991). 

Table 3: K|) values of the different scFv mutants determined in fluorescence titration: l lic 
K\y values are given in nM. the error was calculated from the Scalchard analysis 
(Fig. 4). # determined by Miklasz el al. (1995) 

References 

Belter. M.. Chang. P.. Robinson. R. & Horwilz. A.lf, (1988). £ coli secretion of an active 

chimeric antibody fragment. Science 240. 1041-1043. 
Bird. R.E.. Hardman. K.D.. Jacobson. J.W.. Johnson, S.. Kaufman. B.M., Lee S.M., Lee T.. 

Pope S.H.. Riordan G.S.. & Whitlow M. (1988). Single-chain antigen-binding 

proteins [published erratum appears in (1989). Science 244. 409). Science 242. 

423-6. 

Bedzyk. W.D.. Weidner. K.M.. Denzin. L.K.. Johnson. L.S.. Hardman. K.D.. Panloliano. 

M.W.. Asel. n.D. & Voss. 1:.W.. Jr. (1990). Immunological and slruclural 
characterization of a higli alTmily anti-fluorescein single-chain antibody. ./ BioL 
Chem. 265. 18615-18620. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

-21- 

Blake. M.S.. Johnston. K.ll.. Russel-Jones. OJ. & Gotschlich. E.C. (1984). A rapid, sensitive 

method for detection of alkaline phosphalase-eonjiigaied anti-antibody on Western 

blots. /«;«//. Biochem. 136. 175-179. 
Brinkmann. U.. Rciter. Y.. Jung. S.. Ue. B. & Pastan. 1. (1993). A recombinant immunotoxin 

containing a disulfidc-stabilized Fv fragment, hoc. Nail. Acad Sci. U.S.A. 90. 

7538-7542. 

Carter. P.. Kelley. R.F.. Rodrigiies. M.L.. Snedecor. B.. Covarrubias. M.. Velligan. M.D., 
Wong, W.L.T.. Rowland. A.M.. Kolts. C.E.. Carver. M.E.. Yang. M.. Bouiell. 
J.H.. Shepard. II.M. & Henner. D. (1992). High level Escherichia coli expression 
and production of a bivalent humanized antibody fragment. Bio/ll'chwhgy 10. 
163-167. 

Casari. G. & Sippl. M.J. (1992). Structure-derived hydrophobic potential. Hydrophobic 
potential derived from X-ray structures of globular proteins is able to identify 
native folds. ./. Mol. Biol.. 224. 725-32. 

Chalclher. J., van Rcgenmortel. M.ll.V.. Vemet. T. & Altschuh. D. (1996). Functional 
mapping of conserved residues located at the Vl and Vh domain interface of an 

Fab. J. Mol. Biol., in press. 
Denzin. L.K.. Whitlow. M. & Voss. E.W.. Jr. (1991). Single-chain site-specific mutation of 

lluorescein-amino acid contact residues in high allmity monoclonal antibody 4-4- 

20../. Biol. Cheni. 266. 14095-14102. 
I>:splancq. D.. King. D.J.. Lawson. A.D. & Mountain. A. (1994). Multimcriziilion behaviour 

of single chain Fv variants for the tumour-binding antibody B72.3 . Protein Eng. 7. 

1027-1033. 

Oe. L.. Knappik. A.. Pack. P.. Freund. C. & PlCJckthun. A. (1995). tv/;/mv,«.« aniihodies in 

Escherichia coli. Antibody Engineering. A Practical Approach (Ed. C.A.K. 

Borrebaeck). IRL Press. Oxford, pp. 229-266. 
(Jelhing. M.J. & Sambrook. J. (1992). Protein folding in the cell. Nature 355. 33-45. 
Gill. S.C. & von llippel. P.H. (1989). Calculation of protein extinction coenicicnts from 

amino acid sequence data. /!;;«/. Biociiem. 182.319-326. 
Glockiihubcr. R.. Malia. M.. Plitzinger. I. & Pliickthun. A. (1992). A comparison of strategies 

to stabilize immunoglobulin Fv-fragments. Biochemistry 29. 1362-1366. 
Hialt. A. (1990). Antibodies produced in plants. Nature 344. 469-470. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

-22- 

Hiatl. A. & Ma. J. K. (1993). Characierizalion and applications of antibodies produced in 

plants. hiL Rev. Immunol. 10. 139-152. 
Ho. S.N.. Hunt. II.D.. Horton. R.M.. Pullen, J.K. & Pease. L.R. (1989). Site-directed 

mutagenesis by overlap extension using the polymerase chain reaction. Gene 77. 

51-9, 

Hochuli, E.. Dannwarth. W.. Dobeli, U.. Gentz. R. & Stuber, D. (1988). Genetic approach to 
facilitate puriHcation of recombinant proteins with a novel metal chelate 
adsorbent. liw/Tedvmhgy* 6. 1 32 1 - 1 325. 

Hopp. T.P.. Prickett. K.S.. Price, V.L., Libby, R.T., March. C.J.. Cerretti. D P.. Urdal. D.L & 
Conlon. P.J. (1988). A short polypeptide marker sequence useful for recombinant 
protein identification and purification. B'to/Technolo^ 6, 1204-1210. 

Horwiiz. A. H., Chang. C. P.. Better. M.. Hellslrom. K. H. & Robinson. R. R. (1988). 

Secretion of functional antibody and Fab fragment from yeast cells. Proc. Nali 
Acad ScL U.S.A. 85. 8678-8682. 

liuston. J.. George, A.J.'f.. Tai, M., McCartney, J.E., Jin, D.. Segal. D.M.. Keck. P. & 
Oppermann. H. (1995). Single-chain Fv design and production hy preparative 
folding. Antibody Engineering. A Practical Approach (Ed. C.A.K. Bonebaeck). 
IRL Press. Oxford, pp. 1 85-228. 

Huston. J.S.. Levinson. D.. Mudgell-Hunter, M.. Tai. M.S.. Novotny. J.. Margolies, M.N., 
Ridge. R.J.. Bmccoleri. R.E.. Haber. E. & Crea, R. (1988). Protein engineering of 
antibody binding sites, recovery of specific activity in an anti-digoxin single-chain 
Fv analogue produced in Escherichia coli. Proc, Natl, Acad, Sci. U. S. A, 85. 
5879-83. 

Johnson. K.S., Jackson. R.H. & Chiswell. D.J. (1992). Binding Domains. PCT Application 
WO 92/01787. 

Kabal. E.A.. Wu. I'.T.. Perry. H.M.. Gottesmann, K.S. & Focller. C. (1991), Sequences of 

proteins of immunological interest. U.S. Dept. of Health and Human Services. 

Public Health Service. National Institutes of Health. NIH Publication 91-3242. 
Knappik. A. & PiQckthun. A. (1994). An improved aftinity tag based on the FLAG peptide for 

detection and purification of recombinant antibody fragments. BioTechniques 17, 

754-761. 



SUBSTITUTE SHEET (RULE 26) 



W0 98A)2462 PCT/EP97/03792 

-23- 

Knappik. A. & Pluckthun. A. (1995). Engineered turns of a recombinant antibody improve its 

in vivo folding. Protein En^. 8. 81-89. 
Kunkel. T.A.. Bebenek. K. & McCiary. J. (1991). Efiicient sitc-ilireclcd mutagenesis using 

uracil-containing DNA. Methods in Enzymol. 204. I25-.19. 
Kunkel. T.A., Roberts. J.D. & Zakour. R.A. (1987). Rapid and cfncient site-specific 

mutagenesis without phenolypic selection. Methods in Enzymol. 154: 367-382. 
Lee, B. & Richards. P.M. (1971). The interpretation of protein structures: estimation of static 

accessibility. J. Mol. Biol. 55.379-400. 
Lesk. A.M. & Cholhia. C. (1988). Elbow motion in the immunoglobulins involves a 

molecular ball-and-socket joint. Nature (London) 335.188-190. 
Leistler. B. & Perham. R.N. (1994). Solubilizing buried domain proteins: A self-assembling 

interface domain from glutathione reductase. Biochemisiry 33. 2773-2781. 
Lindner. P.. Guth. B.. WQlfing. C. Krebber. C. Steipe. B.. Muller. F. & PlUckthun. A. (1992). 

Purification of native proteins from the cytoplasm and periplasm of Escherichia 

coll using IMAC and histidine tails: a comparison of proteins and protocols. 

Methods: A Companion lo Methods Enzymol. 4. 41-56. 
Miklasz. S.D.. Gulliver. G.A. & Voss. E.W.. Jr. (1995). Higli-affinity rat anli-nuorescein 

monoclonal antibody with unique fine specificity properties including differential 

recognition of dynamic ligand analogues. ./. Mol. Recognition 8. 258-269. 
Munro. S. & Pelham. H.R.B. (1986). An Hsp70-like protein in the ER: identity with the 78 kd 

gluco.se-regulated protein and immunoglobulin heavy chain binding protein. Cell. 

46.291-300. 

Nozaki. Y. & Tanford, C. (1971). Tlie solubility of amino acids and two glycine peptides in 

aqueous ethanol and dioxane solutions. Establishment of a hydropliobicity scale. 

./. Biol. Chem. 246. 2211-7. 
NyyssOnen. E.. Penttila. M.. Harkki, A.. Saloheimo. A.. Knowles. .1. K. & Keranen. S. (1993). 

Efficient production of antibody fragments by -the filamentous fungus 

Trichoderma reesei. Bio/Technolog}>\\. 591-595. 
Pace. C.N. (1990). Measuring and increasing protein slabiiity. Trends Biotechnol. 8.93-98. 
Pack. P. & PlUckthun. A. (1992). Minianlibodies: use of ampbipalhic helices to produce 

functional. Ilexibly linked dimeric Fv fragments with liigh avidity in Escherichia 

coli. nittchemislrv 31. 1579-1584. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

-24- 

i>ack. P.. Kujau. M.. Schroeckh. V.. Knilpfer. U.. Wenderoth. R.. Riesenberg D. & Plucklhun. 

A. (1993). Improved bivalent miniantibodies. with identical avidity as whole 

antibodies, produced by high cell density fermentation of in Escherichia coVl 

Bio/Technohg}' 1 1 . 1 27 1 - 1 277. 
Pack. P. (1994). Mini-Antikorpcr: Bivalcnle. lelravalcnle iind bispezifische Immunglobuline 

aus E. coh\ Ph.D. thesis. Ludwig-Maximilians-Universitat Munchen. 
Plucklhun. A- (1992). Mono- and bivalent antibody fragments produced in Escherichia coli: 

engineering, folding and antigen binding. Immim. Rev, 130.151-188. 
PiQckthun. A., Krebber. A., Krebber. C, Horn, U., KnUpfer, U.. Wenderoth. R.. Nieba, L. 

Proba. K. & Riesenberg, D. (1996). Producing aniihodies in Escherichia coli: 

From PCR (o fermenlalion, A practical apprach. Antibody Kngineering (Ed. .1. 

McCaffcrly). IRL Press. Oxford, pp. 203-252. 
Potter. K. N.. Li. Y. & Capra. J. D. (1993). Antibody production in the baculovirus expression 

system. Int. Rew Immimoi 10, 103-1 12. 
Prickett. K.S.. Amberg, D.C. & llopp, T.P. (1989). A calcium-dependent antibody for 

identification and purification of recombinant proteins. BioTechniques 7. 580-589. 
Prodromou. C. & Pearl. L.ll. (1992). Recursive PCR: a novel technique for total gene 

synthesis. Profein Eng. 5. 827-829. 
Ridder. R.. Schmitz. R.. Legay. I*. & Gram. II. (1995). Generation of rabbit monoclonal 

atUibody fragments from a combinatorial phage display library and their 

production in the yeast Ptchia pastoris. Bio/lechnology 13. 255-260. 
Rose. G.D. & Wolfenden. R. (1993). Hydrogen bonding, hydrophobicity. packing, and protein 

folding. Annu Rev. Biophys, Biomoi Struct. 22. 381-415. 
Rosenberg, S.A. & Lotze, M.T. (1986). Cancer immunotherapy using interleukin-2 and 

interleukin-2 activated lymphocytes. /Iww. Rev. Immunol. 4. 681-709. 
Skerra. A. & Plucktliun(I988). Assembly of a functional Immunoglobulin Fv fragment in 

Escherichia coli. Science 240. 1038-1041. 
Studier. F.W. & MoiTalt. B.A. (1986). Use of bacteriophage T7 RNA polymerase to direct 

selective high-level expression of cloned genes../. Mol. Biol. 189. 1 13-130. 
Trill. J. J.. Shatzman. A. R. & Ganguly. S. (1995). Production of monoclonal antibodies in 

COS and CI 10 cells. Curr. Opin. BiofechnoL 6. 553-560. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

-25- 

Trowcr. M.K. (1994). Site-directed mutagenesis using a uracil-coniaining phagemid template. 

Methods Mol. Biol. 31.67-77. 
Ullrich. H.D.. Patten. P.A.. Yang. I'.L., Romesberg. F.E. & Schultz. P.O. (1995). Expression 

studies of catalytic antibodies. Proc. Nail. Acad. Sci. USA 92. 1 1907-1 191 1. 
Van Wielink. J.E. & Duinc. J. A. (1990). How big is the periplasmic space. Trends Biochem. 

Sci. 15. 136-137. 

Vitetta. E.S.. Thorpe. P.E. & Uhr, .1. (1993). Immunotoxins: magic bullets or misguided 

missiles. Immunol. Today 14. 253-259. 
Viville. S. (1994). Sitc-dirccled mutagenesis using a double-stranded DNA template. Meihods 

Mol. BioLM. 57-65. 

Ward. V. K.. Kreissig, S. B.. Hammock. B. D. & Choudary. P. V. (1995). Generation of an 
expression library in the baculovirus expression vector system. J. Virol. Methods 
53. 263-272. 

Whitelam. G. C. Cockbum. W. & Owen. M. R. (1994). Antibody production in transgenic 

plants. Biochem. Soc. Trans. 22. 940-944. 
Whitlow. M.. Filupa. D.. Rollence. M.L.. Feng. S. & Wood. J.F. (1994). Multivalent Fvs: 

characterization of single-chain Fv oligomers and preparation of a bispecific Fv. 

« 

Protein Eng. 7. 1 0 1 7- 1 026. 
Wu. X. C, Ng. S. C. Near. R. I. & Wong. S. L. (1993). EfTicient production of a functional 
single-chain antidigoxin antibody via an engineered Bacillus subtilis expression- 
secretion system. Bio/Technology II. 71-76. 



SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

-26- 



Claims 

1. A DNA sequence which encodes an immunoglobulin superfamily (IgSF) domain or 
Iragmenl which differs from a parent IgSF domain or fragment in that the region which 
comprised or would comprise the interface with a second domain adjoined to said parent 
IgSF domain or fragment within the chain of a larger IgSF fragment or protein is made 
more hydrophilic by modification. 

2. The DNA sequence according to claim I in which said modification is substitution of one 
or more amino acids at said interface with amino acids which are more hydrophilic. 

3. The DNA sequence according to claim 1 in which said modification is insertion of one or 
more hydrophilic amino acids in said interface, or insertion of amino acids which increase 
the overall hydrophilicity in said interface, or deletion of one or more hydrophobic amino 
acids in said interface, or deletion of amino acids, said deletion leading to an increase in 
the overall hydrophilicity in said interface. 

4. The DNA sequence according to claim I in which said modillcation consists of any two 
or more of: 

a) substitution of one or more amino acids at said interface with amino acids which are 
more hydrophilic. 

b) insertion of one or more hydrophilic amino acids in said interface, or insertion of 
amino acids which increase the overall hydrophilicity in said interface, 

c) deletion of one or more hydrophobic amino acids in said interface, or deletion of 
amino acids, said deletion leading to an increase in the overall hydrophilicity in said 
interface. 

5. The DNA sequence according to any of claims 2 to 4 in which said substituted or inserted 
amino acid is taken from the list Asn. Asp. Arg. Gin. Glu. Gly, His. Lys. Ser. and Thr. 

6. The DNA sequence according to any of claims I to 5 in which said parent IgSF domain is 
part of an IgSF fragment. 
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7. The DN A sequence according lo any of claims 1 lo 6 in which said domain or fragment is 
derived from an antibody. 

8. The DNA sequence according to claim 7 in which said fragment is a Fab fragment. 

9. The DNA sequence according to claim 7 in which said fragment is an Fv fragment. 

10. 1 he DNA sequence according lo claim 7 in which said fragment is a scFv fragment. 

1 1. The DNA sequence according to claim 7 in which said fragment is an Fv stabilized by an 
inter*domain disulphide bond. 

12. The DNA sequence according to any of claims 9 to 1 1 in which said interface region 
comprises residues 9. 10. 12. 15.39.40.41.80,81,83. 103, 105. 106. 106 A. 107. 108 for 
VL. and residues 9. 10. 1 K 13. 14, 41, 42. 43, 84. 87. 89, 105. 108, 1 10. 1 12. 1 13 for VH. 

13. The DNA sequence according to any of claims 1 to 12. having a contiguous sequence 
which encodes one or more additional moieties. 

14. The DNA sequence according to claim 1 3 in which at least one of said additional moieties 
is a toxin, a cytokine, or a reporter enzyme. 

15. The DNA sequence according to claim 13 in which at least one of said additional moieties 
is at least part of a surface protein of an organism. 

16. The DNA sequence according to claim 15 in which said organism is a filamentous 
bacteriophage. 

17. The DNA sequence according to claim 16 in which said surface protein is the genelll 
protein. 

18. The DNA sequence according to claim 13 in which at least one of said additional moieties 
is capable of binding a metal ion. 

19. The DNA sequence according to claim 1 8 in which at least one of said additional moieties 
comprises at least five histidines. 

20. 1 he DNA sequence according to claim 13 in which said moiieiy is a peptide. 
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2 1 . 1 he DNA sequence according lo claim 20 in which said peptide is a labelling tag. 

22. I he DNA sequence according lo claim 21 in which said labelling lag is c-myc or FLAG. 

23. The DNA sequence according to claim 20 in which said peptide comprises an association 
domain which results in self-association of two or more of said antibody fragments. 

24. The DNA sequence according to claim 23 in which said association domain is derived 
from a leucine zipper or from a hclix-tum-helix motif. 

25. 1he DNA sequence according to claim 20 in which said peptide comprises a first 
association domain which results in hctero-association of one or more of said antibody 
fragments with one or more peptides or proteins comprising a second hetero-association 
domain being able to associate with said first hetero-association domain. 

26. A vector comprising a DNA sequence according lo any of claims 1 to 25. 

27. A host ceil comprising a vector according to claim 26. 

28. An IgSF domain or fragment, or a fusion protein comprising an IgSF domain or fragment, 
encoded by a DNA sequence according to any of claims 1 to 25. by a vector according to 
claim 26. or produced by a host cell according to claim 27. 

29. A diagnostic composition comprising an IgSF domain or fragment, or a fusion protein 
comprising an igSF domain or fragment, according to claim 28. 

30. A therapeutic composition comprising an IgSF domain or fragment, or a fusion protein 
comprising an IgSF domain or fragment, according to claim 28. 

31. A method for deriving a DNA sequence according to any of claims 1 to 25 which 
comprises the following steps: 

i) analyzing the interface region of a parent IgSF domain for hydrophobic residues 
which arc solvent-exposed. 

ii) identifying one or more of the hydrophobic residues to be substituted by more 
hydrophilic residues, or one or more positions where hydrophilic residues or amino 
acid stretches enhancing the overall hydrophilicity of the interface region can be 
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inserted into said interface region, or one or more positions where hydrophobic 
residues or amino acid stretches enhancing the overall hydrophobicity of the interface 
region can be deleted from said interface region, or any combination of said 
substitutions, said insertions, and said deletions to give one or more mutants of said 
parent IgSF domain. 

32. A method for making an IgSF domain or fragment, or a fusion protein comprising an IgSF 
domain or fragment, according to claim 28 which comprises the following steps: 

i) deriving a DNA sequence according to claim 3 1 . 

ii) preparing DNA encoding said mutant or mutants, said DNA being prepared either 
separately or as a mixture, 

iii) introducing said DNA or DNA mixture in a vector system suitable for expression 
of said mutant or mutants, said vector system optionally comprising one or more 
additional DNA sequences suitable for expression of additional IgSF domains or 
fragments, or one or more DNA sequences suitable for expression of a fusion protein 
comprising said mutant or mutants, or any combination of said additional DNA 
sequences. 

iv) introducing said vector system into suitable host cells and expressing said mutant or 
mixture of mutants, or expressing said mutants or mixture of mutants in combination 
with the expression products of said additional DNA sequences. 

V) identifying and characterizing one or more mutants, alone or in said combination, 
which are obtained in higher yield in soluble form, and 

vi) if necessary, repeating steps ii) to vi) to increase Uie hydrophilicity of said 
identified mutant or mutants, alone or in said combination, further. 

33. The method according to claim 32 in which said host is a bacterium, a fungus, a plant, an 
insect cell, or a cell-line of mammalian origin. 

34. A method for the production of an IgSF domain of fragment of claim 28 comprising 
culturing a host cell ol' claim 27 and isolating said domain or fragment. 
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Figure 1: Space filling representation of the Fv fragment of the 
antibody 4-4-20 
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Figure 2a: Variable/constant domain interface residues for VL 
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Figure 2a: Variable/constant domain interface residues for VL 
(cont.) 
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Figure 2a: Variable/constant domain interface residues for VL 
(cont.) 
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Figure 2a: Variablc/constaiif domain interface residues for VL 
(cont.) 
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Figure 2n: Variable/constant domain inlcrlacc residues for VL 
(coiit.) 
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Figure 2a: Variablc/coiislaiit domain iiifcrfacc residues for VL 
(coiit.) 
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Figure 2a: Variable/constant domain interface residues for VL 
(cont.) 
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Tignrc 2a: Variable/constant domain interface residues for VL 
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Figure 2b: Variablc/constanf domain interface residues for VH 




--9£ 

-Bse 

--2E 
-■0£ 

ez 

+ 92 
•-VZ 
--Z2 
•-02 

ei 

+ 91 

21 
01 
8 

z 



o 
o 



o 

CO 



o 



o 



o 



10/27 

SUBSTITUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

Figure 2b: Variable/constant doinaiii interface residues for VH 
(cont.) 
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Figure 2b: Variable/constant domain iiitcriacc residues for VII 
(cout.) 
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Figure 2b: Variable/constant iloinaiii interface residues for VH 
(coiit.) 
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Figure 2b: Variable/constant domain interface residues lor VH 
(cent.) 
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Figure 2b: Variable/constant domain interface residues lor VH 
(cont,) 
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Figure 2b: Variable/constant doiiiaiii interface residues for VH 
(coiit.) 
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KicMircZb: Viimblc/conslmit (lomnin iiUciiiice residues lor VII 
(cont.) 



JZ 

o 




in 
o> 

O) 

CO 

cn 



o 



00 



cr q: 

CO < 

o o 

> u_ 



>- 

>Q 

> lucL 
"Q O m cr ^Tu D q: c/D X > Q > O O o (/) o o > z ^ c/)coo 

oooouoooooooooaouooooooooooo 

U.>.>.^^>->>->>>-U.>>>Ll->-Ll.U.U.>->^>>>LL> 
^^>>.>.>>>>>>>->>->Ll.>.^>->>> > >>>>> 



h- > :> -J cr > -J 



>_j-i->H->i->>-i> ^- >> 



17/27 

SUBSmUTE SHEET (RULE 26) 



wo 98/02462 PCT/EP97/03792 

Figure 2b: Variable/constant domain interface residues ior VH 
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Figure 3: Western blots showing the insoluble (i) and soluble (s) 
fractions of cell extracts 
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Figure 4: Scatcliard plots of fluorescence titration of fluorescein 
with antibody: a) Titration of wt scFv; 
b) Titration of FIu4(V84D) 
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Figure 5: Overlay plot of urea dciiatiiratioii. (x) wt scFv, (o) 
Flii4 
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Figure 6: Tliernial dcnaturation time courses at 40°C and 44°C 
for wt and Flu4 scFv fragments 
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Tabic 1 : Sequence variability of residues contributing to the v/c 
interface 
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Tabic I: Sequence variability of rcsitiues coiitribiitiiig to the v/c 
interface (cont.) 
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Table I: Sequence variability of residues contributing to 
interface (cont.) 
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Tabic 1 : Sequence variability of residues coneributing to the v/c 
interface (cont.) 
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Table I: Sequence variability of residues contributing to tiic v/ 
interface (cont.) 
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Tabic 2: Mutations introduced in the scFv fragment of the 
antibody 4-4-20 

TTiElvD LnN(VH) L11D(Vh) V84D (Vh) 

Flu 1 
Flu 2 
Flu 3 

Ru4 • 

Flu 5 

Flu 6 • • 

Flu 7 • • 

Flu8 • • 

Hu9 • • 

Ru 4 short • 



Table 3: Kd values of the different scFv mutants determined in 
fluorescence titration 





Flu wt 


Flu 3 


Flu 4 


Flu 6 


Ru wt# 


Kd (nM) 


80 ± 7 


60± 12 


70± 10 


75 ±13 


90 
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